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Maximum stellar mass versus cluster membership number revisited 
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ABSTRACT 

We have made a new compilation of observations of maximum stellar mass versus cluster 
membership number from the literature, which we analyse for consistency with the predic- 
tions of a simple random drawing hypothesis for stellar mass selection in clusters. Previously, 
Weidner and Kroupa have suggested that the maximum stellar mass is lower, in low mass 
clusters, than would be expected on the basis of random drawing, and have pointed out that 
this could have important implications for steepening the integrated initial mass function of 
the Galaxy (the IGIMF) at high masses. Our compilation demonstrates how the observed dis- 
tribution in the plane of maximum stellar mass versus membership number is affected by the 
method of target selection; in particular, rather low n clusters with large maximum stellar 
masses are abundant in observational datasets that specifically seek clusters in the environs 
of high mass stars. Although we do not consider our compilation to be either complete or 
unbiased, we discuss the method by which such data should be statistically analysed. Our 
very provisional conclusion is that the data is not indicating any striking deviation from the 
expectations of random drawing. 



Key words: 

clusters 



1 INTRODUCTION 



stars: mass function - galaxies: stellar content - stars: formation - galaxies: star 



and conclude (Section 5) with an appeal for further observational 
information to be used in future analyses. 



It is well known (fo llowing IWeidner & Kroupj | |2004 l2006l) and 
lOev & Clarkj ( |2005|) ) that in the case of clusters containing fewer 
than ~ 100 OB stars (i.e. those with mass < a few x IC* M©) 
the maximum stellar mass increases with cluster mass. At higher 
cluster mass scales, the value of the maximum stellar mass satu- 
rates at around 150-200 Mg for re asons that are not entirely clear 
rseee.e. l Zmnecker&a B . In this paper, we restrict our- 
selves to considering the lower mass regime. In Section 2 we re- 
view why the statistics of maximum stellar masses in clusters of 
various scales can place constraints on high mass star formation in 
a cluster context and how rather subtle differences in assumed al- 
gorithms for cluster building are imprinted on the integrated galac- 
tic IMF (the IGIMF). We emphasise that analysis of the statis- 
tics of maximum stellar mass versus cluster mass offers the best 
prospects for an observational determination of whether the IGIMF 
sho uld be different from that measured in ind ividual clusters (see 
e.g. lWeidner & KrouDj200^ : lElmegreerl2006' ). We also stress that 
competing algorithms can only be distinguished through proper sta- 
tistical analysis of the observed distributions and that selecting al- 
gorithms according to how they reproduce the mean trend can be 
misleading. In Section 3 we present a new (but in all likelihood 
still incomplete and biased) compilation of observational informa- 
tion on this issue and highlight the sensitivity of the distribution 
obtained to the method of target selection. In Section 4 we discuss 
the statistical inferences that can be drawn from the current dataset 



2 THE IMPORTANCE OF MAXIMUM STELLAR MASS 
DATA AND ITS STATISTICAL ANALYSIS 

The simplest interpretation of the fact that the maximum stellar 
mass is lower, on average, in lower mass clusters is that this just 
derives from the statistics of random sampling. To take a simple 
analogy, the average height of the tallest inhabitants of large cities 
is likely to be greater, on average, than the average height of the 
tallest individuals in small villages. It would however to be incor- 
rect to infer from this that there is, for example, a nutritional defi- 
ciency among village dwellers. On the other hand, a better analogy 
might be with the wealth of richest individuals in settlements of 
various sizes, since in this case this might reflect the size of the 
local economic base. This is the sort of argument used by Weid- 
ner & Kroupa, who point out that in the case of cluster formation, 
the stars acquire their mass directly from the available gas reser- 
voir Their simple Monte-Carlo simulations build the finite size of 
the gas reservoir into their algorithms for stellar mass selection and 
reject any star whose formation causes the total designated cluster 
mass to be exceeded. This 'rejection' element preferentially affects 
more massive stars and is chiefly manifest through a statistical low- 
ering of the maximum stellar mass compared with its value in ran- 
dom sampling experiments. Another plausible algorithm was pro- 
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a. 




Figure 1. Probability density of the most massive star, p(mniax) (eo. lASI 
in Appendix |A) for a star cluster containing n = 30 stars. Characteris- 
tic quantities are the mean, rrtmax. and the median, mi/2 - The l/6th and 
5/6th quantiles limit the shaded region containing 2 / 3rd of the most mas- 
sive stars. 



posed bv lElmegreenI ( |2006|) . motivated by the fact that the fraction 
of the initial gas in a protocluster that ends up in stars may be signif- 
icantly less than unity. In this algorithm, therefore, although stellar 
mass selection is terminated once the total stellar mass exceeds the 
designated total mass, the last star is only rejected if this takes the 
total cluster mass over a value equal to the sum of the designated 
cluster mass and the mass of an additional gas reservoir. Since this 
is a softer rejection criterion, this algorithm produces results that 
are closer to random sampling than a strictly mass constrained al- 
gorithm. 

Since in all these cases the maximum stellar mass follows 
quite a broad distribution (at fixed cluster scale), these differences 
cannot be discerned from a single observational datapoint. Instead 
it is necessary to compare observed distributions (at given cluster 
scale) with the results of simulations (or, in the case of the simplest, 
random drawing, hypothesis, with the results of analytic predic- 
tions). Some care is needed when considering the best property of 
the distribution that should be compared with observational data, as 
is demonstrated in Figure[T]which illustrates the probability density 
function of maximum stellar mass at given cluster mass scale in the 
case of the random drawing hypothesis. Clearly, this distribution is 
highly asymmetric, with the mean significantly exceeding the me- 
dian. This means that in the case of sparsely sampled observational 
data (i.e. not many clusters at given mass scale), the observed mean 
is likely to be lower than the true mean. A better approach is instead 
to compute the predicted distribution (as a function of cluster mass) 
and use a non-parametric method (e.g. a Kolomogorov-Smimov 
test) to compare these with the distribution of observational data- 
poi nts in the plane of cluster mass versus maximum stellar mass 
(see lOev & Clarke (2005) and Section 3). 

Thus far we have discussed the interpretation of these statis- 
tics in terms of what light they may shed on cluster formation (ob- 
viously the algorithms described above are simple 'toy models' but 
a clear signature in favour of one of them could be useful, for exam- 
ple, in determining how much stellar mass assignments are shaped 
by strict limitations in available gas supply). Another implication 
is purely empirical: we have stressed that the steepening of the up- 
per IMF (and the resulting reduction in maximum stellar mass) in 
the case of non-random mass selection algorithms are too subtle to 



be detectable in any given cluster (i.e. each cluster is statistically 
consistent with being drawn from the input IMF). However, when 
one combines the results of many clusters (i.e. - on the assump- 
tion that the galactic field is composed of dissolved clusters - if 
one turns an IMF into an IGIMF) the signature of algorithms that 
preferentially reject high mass stars is seen in a steepening of the 
IGIMF. This impor t ant in sight was firs t discussed in this way by 
iKroupa & Weidneil l l2003h (though see IVanbeverenI l |l982h for an 
earlier version of the argument). The reason why such star rejec- 
tion algorithms - which are only important in the lower mass clus- 
ters that we discuss here - have a discernible effect on the IGIMF is 
simply that, given the steepness of the observed cluster mass func- 
tion, low mass clusters make an important contribution to the galac- 
tic field. 

Naturally, a steeper IGIMF has implications for how a range 
of quantities (such as supernova rate or ionising luminosity or 
chemical enri c hmen t) relate to the galactic star formation rate. 
IWeidner et al. Il l200l also extended the argument by positing that 
similar considerations apply to star cluster maximum masses in 
galaxies of different masses. This means that in lower mass galax- 
ies, the field population would be more dominated by lower mass 
clusters, and therefore that the IGI MF would be more steepened by 
the effect described above. Indeed, |pfl amm-Altenburg et al. 
have gone on to argue that this has important implications for the 
mapping between Ha luminosity and star formation rate in dwarf 
galaxies and would lead to the systematic ««der-estimation of the 
SFR in dwarfs. 

Observationally, opinion is strongly divided as to whether 
there is good evidence that the IGIMF is st eeper than S alpete r 
(or if it varies between galax ie s): see e.g. Elmegreenl 1 20061) . 
IPflamm-Altenbur g et al.' ('2007h. ISelman & Melnickl | |2008|) and 
discussion in lClarke (2008i). We may, however, be able to turn this 
question around: (/we can use cluster data to determine whether the 
maximum stellar mass statistics are indeed compatible with random 
drawing models then we can immediately learn whether the IGIMF 
should be equal to the IMF (without recourse to any Galaxy-wide 
or extragalactic data). Although, as we shall see in the following 
Section, it is not straightforward to achieve an unbiased sample for 
analysis, it is obviously attractive to be able to use rather simple, 
local observations to constrain a quantity which is potentially of 
extragalactic significance. 



3 OBSERVATIONAL DATA 

In what follows, we attempt a simple test and enquire: is obser- 
vational data on maximum stellar mass as a function of cluster 
scale compatible with the hypothesis of random drawing from a 
universal IMF (i.e. the same exponent and mass limits for all clus- 
ters)? Incompatibility would have important consequences for the 
IGIMF, as we have seen above and we would then need to en- 
quire what other algorithms could achieve consistency. On the other 
hand, compatibility (at whatever desired significance level) would 
not necessarily imply that random drawing is the 'best fit model' ; 
it would however remove much of the motivation for finding more 
complex alternatives. 

The an alytical mode l for random drawing o f stel lar masses 
(see IOev& Clarke (20051) orHdman & Melnicl^ ( l2008h , and Ap- 
pendix|A]of this paper that recapitulates the main results) is based 
on the expected distribution of maximum stellar masses in the case 
that one makes a given number, n, of selections from a given mass 
function. It therefore makes sense to make n the independent vari- 
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Figure 2. Mass of the most massive star versus the number of stars in the cluster (for better visibility, a small random scatter was applied to the (discrete) 
masses). The data ai'e collected from the literature, with the main sources Testi et al. (4) and Weidner & Kroupa (■). The references for the other points are 
given in Appendix Ib] The solid line is the mean value of rrimax depending on n. The dotted lines follow the 1/6 and 5/6 quantiles, and should confine 2/3rd 
of the observed data. 



able (unlike in the case of mass constrained models where clus- 
ter mass is the obvious choice). In order to obtain a homogeneous 
sample the data are renormalised to a common lower limit be- 
cause of differing observational lower mass limits for each clus- 
ter. Each cluster is designated by the expected number of stars 
that it would contain down to 0.08 M© (i.e. not including brown 
dwarfs), given the observed number and mass limit, and assum- 
ing an I MF for th e missing ranged We employ a two-p art power 
law I MF jKroupj laboi. 2002) with a scaling as m"^'^^ l lSalpeterl 
fl95^ for stars > 1 Mq. In order to compare the completeness 
magnitude of a particular set of observations with the lower mass 
limit of our analys is, we use the conversio n between K magnitude 
and mass given bv lCarpenter et alj l ll993l) (m = io-0-24'"k-+0-24 
or m = iQ-0-2'o-m.H+0A4,y ^.jugters are con-ected (at least in a 
statistical sense) for background or foreground contamination. Fur- 
thermore we demanded that the observed region was large enough 
to contain the whole cluster area. 

As far as the maximum stellar mass is concerned, we either 
use values quoted in the literature or else estima te masses from 
listed spectral types using the conversion given in ISchmidt-Kalerl 
( Il982h (the masses for spectral types not contained in the list being 
interpolated). 

Our criterion for including a cluster is only that we have found 
it to be possible to derive estimates of both n and rrimax in this way. 
As we discuss below, it is unlikely to be either a complete or an un- 
biased sample and this makes any conclusions that we draw from 
this dataset extremely preliminary. Figure [2] compares all the data 
that we have assembled with the predicted centiles of the random 
drawing model (i.e.the mean and the 1/6 and 5/6 contours of the 
cumulative distribution). The data is coded according to source: ■ 

^ Note that the choice of this lower limit is ai'bitrary, provided that it is 
self-consistently applied to all the observational data and to the analytic 
predictions. 



for the data tabulated in Weidner & Kroupa,^ for that obtained by 
Testi et al and • for miscellaneous other observations (see Ap- 
pendix|B). 

One of the hardest aspects of constructing Figure |2]is the as- 
signment of realistic errorbars (in n; errors in mmax are negligible 
by comparison, since we include only clusters which are young 
enough for their most massive members not to have expired as su- 
pernovae). We have drawn one-sided errorbars, on the grounds that 
we are probably missing stars that are located at large distances 
from the most massive star where the density of sources on the sky 
falls below the local background value, either as a result of initial 
conditions or dynamical evolution. We are interested here in the 
total population of stars that was formed with the most massive 
object, irrespective of whether these stars are currently bound to 
the natal cluster. Dynamical evolution in small n clusters can how- 
ever c ause significant expansion over a few Myr l lBonnell & Clarkd 
and this effect increases the likelihood that we may be miss- 
ing stars at large distances. In order to estimate the possible error 
introduced in this way, we really need dynamical simulations on a 
cluster by cluster basis, which limit the range of original configura- 
tions (cluster n and size) that are compatible with the present cen- 
sus of background corrected objects. To our knowle dge, this exer- 
cise h as only been undertaken in one cluster {rj Cha. lMoraux et al] 
l2007h where the total n lies in the range of 18 (as observed) to 
40 (the maximum number that is compatible with leaving a cluster 
with the parameters observed). With this in mind, we add one-sided 
errorbars of a factor of 2 in Figure|2] although note that this is pes- 
simistic (i.e. too large) for the larger n clusters where the rate of 
dynamical dispersion is probably lower. 

An obvious feature of Figure |2] is that different regions of the 
n, mmax plane are populated by clusters obtained through differ- 
ent observing strategies. Evidently, the squares are low compared 
with the centiles, explaining why Weidner & Kroupa found it nec- 
essary to invoke a non-random algorithm for stellar mass selection. 
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The data of Testi et al. is apparently discrepant in the opposite di- 
rection - i.e. maximum stellar masses are, if anything, rather large, 
given the number of stars in the clusters. The reason why these two 
datasets are complementary (and both necessary to the statistical 
analysis) is simply one of the order in which the properties of the 
systems were determined. Weidner & Kroupa sought data on re- 
gions recognised as 'star clusters' and then found the maximum 
recorded mass. Testi et al. instead first identified massive stars (in- 
cluding those that are apparently isolated) and then undertook deep 
infrared imaging of the environs in order to identify any surround- 
ing over-density of low mass stars. Unsurprisingly, the ratio of max- 
imum stellar mass to cluster number is considerably higher in the 
latter case. 



4 ANALYSIS OF THE DATASET 

The observational data contained in Figure |2]is highly incomplete 
and biased and so great care must be taken in its statistical analysis. 
In this section we discuss whether a subset of the data can be used 
to settle whether the results are consistent with the expectations 
of the random drawing hypothesis. We here remind the reader that 
acceptance (rejection) of this hypothesis means that the high mass 
tail of the IGIMF should be identical to (steeper than) the input 
Salpeter IMF. 

If we simply took all these datapoints at face value, we could 
evaluate a cumulative probability for each datapoint (i.e. evaluate 
the probability that the maximum mass is less than or equal to the 
datapoint value, according to the theoretical distribution for that 
particular value of n): if the observations match theoretical expec- 
tations, these probabilies should be uniformly distributed between 
and 1. We can then use a KS test to compare the probability dis- 
tribution with a uniform distribution. The probability that data gen- 
erated through random drawing would be as discrepant from the 
theoretical prediction as is that observed is 10^^^ (adopting the 
membership numbers denoted by symbols in Figure |2]( and 10~* 
(if one instead adopts twice these values, i.e. corresponding to the 
upper end of the errorbars shown in Figure|2](. At face value, there- 
fore, one would overwhelmingly reject the hypothesis of random 
drawing. The reason for the discrepancy (as can be seen from Fig- 
ure|2ll is that the lower range of the cumulative distribution function 
is actually imrfer-populated by the data. 

This conclusion is however highly misleading - the discrep- 
ancy is strongly driven by the very large number of datapoints in 
Figure |2] (from Testi et al.) which populate the upper regions of the 
cumulative distribution. However, it needs to be remembered that 
we simply do not have complete data. 

One way forward is to define a stellar mass, mcpi, such that 
we deem that we have the information on all clusters (of all n), 
for which the maximum mass is > mcpi. By retaining all the data 
with mniax > rricpi, however, we then have data which is complete 
down to different values of the cumulative distribution depending 
on the value of n, which is impractical for a statistical test and 
its interpretation. A more convenient approach is to select the data 
such that they are complete down to the same cumulative proba- 
bility Pcut . By this criterion all data points are selected which lie 
above the mass mcut in) which corresponds to Pcut in the cumula- 
tive distribution of mmax (n) . 

Furthermore, we do not take into account the very small n 
data, because of their presumably large error bars. Thus we in- 
troduce a minimum number of stars in a cluster, n^pi, which are 
needed to ensure the quality of the selected data. The selection pa- 




Observed probabilities 



Figure 3. A plot of the ordered cumulative probabilities versus the observed 
cumulative probabilities derived from the distribution of mmax for a pai- 
ticular n. The shown data fullfill the selection criterion discussed in Section 
|4] i.e. they lie in the dashed wedge in Fig.|2] The filled symbols follow with 
the observed n, whereas for the open symbols 2n was used, corresponding 
to the right end of the en'or bai"s in Fig.|2](H: Weidner & Kroupa; ♦ Testi et 
al.). If the data were uniformly distributed they should follow the diagonal. 



rameter Pcut follows then as the cumulative probability of JTicpi 
for the given ricpi. The choice for the values of m-cpi and ricpi is 
discussed below. 

We demonstrate the selection criterion in Figure[2]for the case 
rricpi — 3Mq , ncpi — 30, where the diagonal dashed line tracks 
the values of mcut(n-) with Pcut = 0.53. By restricting ourselves 
to data within the wedge of the diagram above the dashed line, we 
are obviously not using a lot of the data, but have now defined a 
sample which is complete down to a fixed point in the cumula- 
tive distribution at every value of n included. We can therefore test 
whether these cumulative probabilities are uniformly distributed in 
the range Pcut to 1. 

If we, for the moment, disregard any physical or observational 
ground for choosing particular values of mcpi and ricpi, we can 
use different values of these parameters so as to explore various 
aspects of the two dimensional distribution. As expected, the KS 
probabilities are sensitive to the values of the mcpi, Jicpi adopted, 
with probabilities being higher if the choice is such (e.g. through 
high mcpi or either very high or very low Jicpi) that the number of 
datapoints retained is small. 

We are particularly looking for evidence for a lack of data at 
the upper end of the cumulative distribution (as this would imply 
mmax was less than that implied by random drawing); we are how- 
ever finding in general that this is not a striking feature of the dis- 
tribution. In fact, a significant dearth of high mmax values is only 
recorded if one selects mcpi and ricpi so as to just include the clump 
of datapoints at around IOMq and n ~ 60, since in this case the 
very top of the cumulative distribution is slightly under-represented 
(for example, for mcpi ~ 9Mq and ricpi — 60, there are many 
more datapoints with cumulative probabilities slighly larger than 
Pcut (= 0.75) compared with those at higher values, and this is 
reflected in a relatively low KS probability of around 2%). In gen- 
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eral, however, the feature of the plot that is generally flagged as 
most discrepant by the KS test is not a dearth of the highest values, 
but instead the lack of systems at around 3Mq < mmax < IOMq 
and n ~ 50 — 100, which is readily visible as a data hole in Figure 

El 

It is obviously unsatisfactory if we tune the values of mcpi 
and ricpi so as to retroactively highlight a particular aspect of the 
data. Therefore, as our best guess of plausible parameter values, we 
adopt rricpi = 3Mq and jicpi = 30. The choice of ncpi is motivated 
by the fact that we consider the errorbars in membership number of 
smaller clusters to be very high: both b ecause dynamical evolu- 
tion is more rapid in smaller n systems jBonnell & Clarkelll999l) 
and also because the ejection of even a few stars to radii where 
they cannot be distinguished from the background causes relatively 
large fractional errors in n. The choice of rricpi reflects the mass 
of a moderately luminous Herbig Ae star as targeted by Testi et al 
1998. We have cautioned above that we must not go to much lower 
masses, since young stars of close to solar mass and below have not 
been systematically targeted for surrounding clusters. 

The distributions derived from the data, for this choice of mcpi 
and ncpi, are represented graphically in Figure[3] For each retained 
datapoint, we calculate the position in the theoretical cumulative 
distribution function (plotted on x-axis) and on the y-axis we plot 
the ranked position of the datapoint. (Note that this latter quan- 
tity has been renormalised to lie in the interval [Pcut , 1] instead of 
[0, 1]) The filled symbols use membership numbers denoted by the 
symbols in Figure 2, whereas the open symbols correspond to the 
case when values of n a factor two larger are adopted. Note that 
the number of datapoints are not the same in the two cases, since 
shifting n by a factor of two moves data values in and out of the 
region above the dashed line in Figure 2. If the data conforms to 
the random hypothesis, then the data plotted in Figure 3 should be 
following the diagonal. 

A KS test performed on this data yields a KS probability of 
nearly 20%, implying quite adequate agreement with the random 
drawing hypothesis Q Although both curves in Figure 3 are some- 
what above the diagonal at the uppermost end (implying a mild 
deficit of data values at the top of the predicted cumulative dis- 
tribution function), this discrepancy is not significant in this sam- 
ple (and, as discussed above, the feature in the filled curve that is 
most discrepant is actually the mild deficit of data around ~ 0.6 
in the cumulative distribution, corresponding to the 'data hole' at 
mmax ~ 3 - IOM0, n ~ 50 - 100 in Figure 2). 



5 CONCLUSIONS 

We have high-lighted the difficulty in analysing the data contained 
in Figure|2]owing to difficulties to assigning regions of the diagram 
where the data is believed to be complete. 

Nevertheless, our preliminary conclusion is that we are not 
seeing strong evidence for a systematic suppression in maximum 
stellar mass in small n clusters in addition to that expected on the 
basis of the statistics of random drawing (see also the comple men- 
tary an alysis of the statistics of isolated stars by Parker & Goo dwinI 
( |2007|) . which reached similar conclusions). Indeed, if anything, 

^ We also tested the sensitivity of our results to the mapping employed 
be tween spectra l type and mass by noting that if one uses the calibration 
of lMailins et alj ilOOSi) , two of the O-stars in our sample are signiiicantly 
reduced in mass. This adjustment however does not change the reasonable 
agreement with the null hypothesis. 



the feature of Figure |2] that seems to be most discrepant with the 
random drawing model is the data hole in the range mmax ~ 
3 — IOM0 , n ~ 50 — 100. We are however aware that this might 
indeed be filled in if we have under-estimated the incompleteness 
in smaller n clusters (particularly due to the effects of dynamical 
evolution). 

Our conclusion (in support of the random drawing hypothesis) 
remains provisional. Although we have set out what we believe to 
be a statistically correct methodology for analysing the problem, 
we are highly aware of the difficulties of properly quantifying ob- 
servational selection effects. We therefore seek further input from 
observers in compiling a good sample for this kind of analysis. 
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APPENDIX A: THE DISTRIBUTION OF Mmax 



APPENDIX B: DATA OF THE USED CLUSTERS 



The IMR used in this work is according to iKroupal (l20qil. |2002|) 
with a Salpeter exponent for massive stars dSalpeter 19551 : MassevI 
Il998h . 



m ^''^ TTiMiN < m < 0.5 Mq 
m~^-^^ 0.5 Mq < m < ttimax ' 



(Al) 



where the lower limit ttimin = 0.08 M© and brown dwarfs are 
not included. Since we consider 'pure' random sampling the upper 
limit does not depend on the total mass of the star cluster (ttimax = 
150 Mq). We use the IMR as a probability density, i. e. normalised 
to 

r-"iMAX 

^(m)dm = 1. (A2) 

IN 

In a sample of "identical" star clusters (with the same n) the mass 
of the most massive star in each cluster will not be the same but 
follow its own distribution function. Ror this kind of Monte-Carlo 
experiment the distribution of the most massive star can analyti- 
cally be derived. The probability for the most massive star to lie in 
the mass interval mmax, r 



P{m e [mmax, m 

n 



x,mmax +dmis 
+ dm]) = ^ (mmax) dm. 



(A3) 



All other stars must have a mass smaller than m. The probability to 
pick randomly n — 1 stars from the mass range mMiN, m is 



P(mi., 



G [mMIN,mmax]) = 



5(m')dm') \A4) 



The probability distribution of the most massive star is then the 
product of eqns. |A3| and lA4l multiplied with the factor n because 
every star could be the most massive star. To obtain the probability 
distribution the product has to be differentiated with respect to m. 
This gives 



P(mmax) 



In the compilation o f IWeidner & Kroupal ilOod) the total mass of a 
cluster down to 0.01 M© is given. We converted this mass into 
a membershp number by dividing with the average stellar mass 
(0.36 Mq), and subtracted the expected number of brown dwarfs 
(5(m) oc m-°-^ for 0.01 Mq ^ m < 0.08 Mq) . In the cases 
where numbers are given we used them. 

Rrom the list of Testi et al. we took the Ic values, which are 
background-connected total numbers and range down to the limiting 
magnitude M'f^. One particular star/cluster, MWC 300 (mmax ~ 
5 Mq), is quite distant (15.5 kpc) and has a very high limiting 
mass (« 2.4 Mq). Therefore the observed number of 21 stars is 
corrected to 740 stars. This is the far right outlier in Rig. |2l and 
the correctness of this data point is questionable, but included to be 
complete. 



^(m')dm' J C("T-max) 
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